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Register user's favorite site 



S101 



Crawl sites and prepare metadata 



S102 



Extract new keywords 



S103 



Calculate important keywords 



S104 



Extract sentence - level important 
information element set 



S105 



Extract topic keywords 



S106 



Extract word - level important information elements 



S107 



Apply and present visually extracted important S108 
information elements w " w s " 
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Example registered site 



<site> 

<url>http://www. ibm. co. jp/</url> 

<name>IBM Japan Homepage</name> 
</site> 
<site> 

<url>http://www, ibm. com/</url> 

<name>IBM Corpora tion</name> 
</site> 
<site> 

<url>http://www. jp. ibi. com/shop/</iirl> 

<name>IBM Japan, "Shopping" </name> 
</site> 
<site> 

<url>http://www. jp. ibm. coni/developerworks/</url> 
<name>deve 1 operWorks</name> 
</site> 
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20 



Metadata preparation mechanism 



21 



Information element extraction mechanism 



22- 



Attribute extraction mechanism 



23^ 



24- 



25^ 



Morphological analysis mechanism 



Keyword extraction mechanism 



Keyword categorization mechanism 



Input file (metadata) 
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Example link 

Expression in an HTML file 

<a href =,1 http://www. ifaE com/ i p/so f twa re/da t a/udb/v7/s em I nar. 

htmr">DB2 UDB V7 open free seminar </ a > 



Extracted information elements 



<aflchor> 

// title of a link 

<DCtitle>DB2 UDB V7 open free seminar </DCt it le> 
// URL 

<url>http://www. ibm coia/jp/sof tware/data/udb/v7/seminar. html</url> 
//extracted keywords 

<kwds> 

// keywords // categories of keywords 

<kwd><word>DB</word><c 1 ass>T0</c iass></kwd> 
<kwdXword>UDB</word><class>TO</class></kwd> 
<kwdXword>V</wordXc 1 as s>T0</c 1 assX/kwd> 
<kwdXword>seminar </wordXclass>13</cl assX/kwd> 
<kwdXword>open </wordXc lass>13</cl assX/kwd> 
<kwdXword>free</wordXclass>13</class></kwd> 
</kwds> 
</anchor> 



Fig. 5 



JP9 - 2000 - 0421 - JP1 
6/15 



Example text block 
Expression in an HTML file 

industry talk / this week's e - column 



Extracted information elements 



<text> 

// text portion 

<DCdescriptlOIl> industry talk / this week's e - column 

</DCdescription> 
<kwds> 

<kwdXwo rd>e</word><c 1 as s>T0</c 1 ass></kwd> 
<kwdXword> industry </wordXclass>T(K/cIassX/kwd> 
<kwdXword> coiumn</ W ordXclass>ll</classXAwd> 
<kwdXword> talk </ W ordXc 1 ass> 1 3</c 1 assX/kwd> 
<kwdXword> this week</ W ordXclass>ll</cIassX/kwd> 
</kwds> 
</text> 



Fig. 6 
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